Search CORE

Detecting complex events in user-generated video using concept classifiers

Author: Guo Jinlin
Gurrin Cathal
Hopfgartner Frank
Scott David
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/06/2012
Field of study

Automatic detection of complex events in user-generated videos (UGV) is a challenging task due to its new characteristics differing from broadcast video. In this work, we firstly summarize the new characteristics of UGV, and then explore how to utilize concept classifiers to recognize complex events in UGV content. The method starts from manually selecting a variety of relevant concepts, followed byconstructing classifiers for these concepts. Finally, complex event detectors are learned by using the concatenated probabilistic scores of these concept classifiers as features. Further, we also compare three different fusion operations of probabilistic scores, namely Maximum, Average and Minimum fusion. Experimental results suggest that our method provides promising results. It also shows that Maximum fusion tends to give better performance for most complex events

Semantic concept detection in imbalanced datasets based on different under-sampling strategies

Author: Foley Colum
Guo Jinlin
Gurrin Cathal
Lao Songyang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2011
Field of study

Semantic concept detection is a very useful technique for developing powerful retrieval or filtering systems for multimedia data. To date, the methods for concept detection have been converging on generic classification schemes. However, there is often imbalanced dataset or rare class problems in classification algorithms, which deteriorate the performance of many classifiers. In this paper, we adopt three “under-sampling” strategies to handle this imbalanced dataset issue in a SVM classification framework and evaluate their performances on the TRECVid 2007 dataset and additional positive samples from TRECVid 2010 development set. Experimental results show that our well-designed “under-sampling” methods (method SAK) increase the performance of concept detection about 9.6% overall. In cases of extreme imbalance in the collection the proposed methods worsen the performance than a baseline sampling method (method SI), however in the majority of cases, our proposed methods increase the performance of concept detection substantially. We also conclude that method SAK is a promising solution to address the SVM classification with not extremely imbalanced datasets

Searching for videos on Apple iPad and iPhone

Author: Foley Colum
Guo Jinlin
Gurrin Cathal
Paul Ferguson
Scott David
Publication venue
Publication date: 01/07/2011
Field of study

In this demonstration we introduce our content-based video search system which runs as an app on the Apple iPad or iPhone. Our work on video search is motivated by the need to introduce content-based video search techniques, which are currently the preserve of the research community, to the larger YouTube generation. It was with this in mind, that we have developed a simple but engaging content based video search engine which uses an iPad or iPhone app as the front-end user interface. Our app supports the three common modes for content-based video search: text search, concept search and image-similarity search. Our iPad system was evaluated as part of the TRECVid 2010 evaluation campaign where we compared the performance of novice versus expert users

Localization and recognition of the scoreboard in sports video based on SIFT point matching

Author: Foley Colum
Guo Jinlin
Gurrin Cathal
Lao Songyang
Smeaton Alan F.
Publication venue
Publication date: 01/01/2011
Field of study

In broadcast sports video, the scoreboard is attached at a fixed location in the video and generally the scoreboard always exists in all video frames in order to help viewers to understand the match’s progression quickly. Based on these observations, we present a new localization and recognition method for scoreboard text in sport videos in this paper. The method first matches the Scale Invariant Feature Transform (SIFT) points using a modified matching technique between two frames extracted from a video clip and then localizes the scoreboard by computing a robust estimate of the matched point cloud in a two-stage non-scoreboard filter process based on some domain rules. Next some enhancement operations are performed on the localized scoreboard, and a Multi-frame Voting Decision is used. Both aim to increasing the OCR rate. Experimental results demonstrate the effectiveness and efficiency of our proposed method

TRECVid 2011 Experiments at Dublin City University

Author: Foley Colum
Guo Jinlin
Gurrin Cathal
Hopfgartner Frank
Scott David
Smeaton Alan F.
Publication venue
Publication date: 01/01/2012
Field of study

This year the iAd-DCU team participated in three of the assigned TRECVid 2011 tasks; Semantic Indexing (SIN), Interactive Known-Item Search (KIS) and Multimedia Event Detection (MED). For the SIN task we presented three full runs using global features, local features and fusion of global, local features and relationships between concepts respectively. The evaluation results show that local features achieve better performance, with marginal gains found when introducing global features and relationships between concepts. With regard to our KIS submission, similar to our 2010 KIS experiments, we have implemented an iPad interface to a KIS video search tool. The aim of this year’s experimentation was to evaluate different display methodologies for KIS interaction. For this work, we integrate a clustering element for keyframes, which operates over MPEG-7 features using k-means clustering. In addition, we employ concept detection, not simply for search, but as a means of choosing most representative keyframes for ranked items. For our experiments we compare the baseline non-clustering system to a clustering system on a topic by topic basis. Finally, for the first time this year the iAd group at DCU has been involved in the MED Task. Two techniques are compared, employing low-level features directly and using concepts as intermediate representations. Evaluation results show promising initial results when performing event detection using concepts as intermediate representations

Evaluating novice and expert users on handheld video retrieval systems

Author: Guo Jinlin
Gurrin Cathal
Hopfgartner Frank
Scott David
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Content-based video retrieval systems have been widely associated with desktop environments that are largely complex in nature, targeting expert users and often require complex queries. Due to this complexity, interaction with these systems can be a challenge for regular ”novice” users. In recent years, a shift can be observed from this traditional desktop environment to that of handheld devices, which requires a different approach to interacting with the user. In this paper, we evaluate the performance of a handheld content-based video retrieval system on both expert and novice users. We show that with this type of device, a simple and intuitive interface, which incorporates the principles of content-based systems, though hidden from the user, attains the same accuracy for both novice and desktop users when faced with complex information retrieval tasks. We describe an experiment which utilises the Apple iPad as our handheld medium in which both a group of experts and novice users run the interactive experiments from the 2010 TRECVid Known-Item Search task. The results indicate that a carefully defined interface can equalise the performance of both novice and expert users

Crossref

TRECVid 2012 experiments at Dublin City University

Author: Albatal Rami
Gurrin Cathal
Hopfgartner Frank
Jinlin Guo
Scott David
Smeaton Alan F.
Zheng Xing
Publication venue
Publication date: 01/10/2012
Field of study

Following previous participations in TRECVid, this year, the DCU-IAD team participated in four tasks of TRECVid 2012: Instance Search (INS), Interactive Known-Item Search (KIS), Multimedia Event Detection (MED) and Multimedia Event Recounting (MER)

Helping the helpers: How video retrieval can assist special interest groups

Author: Guo Jinlin
Gurrin Cathal
Hopfgartner Frank
Scott David
Wang Hongyi
Yang Yang
Zhang Zhenxing
Zhou Lijuan Marissa
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Given the increasing broadcasting data and the ever decreas- ing spare time that we can spend on consuming this data, systems are required that assist us in identifying important content. Following a use case of a fictional social worker, we introduce a video retrieval system that is designed to assist special interest groups in their information gathering task

DCU at MMM 2013 video browser showdown

Author: Guo Jinlin
Gurrin Cathal
Hopfgartner Frank
McGuinness Kevin
O'Connor Noel E.
Scott David
Smeaton Alan F.
Yang Yang
Zhang Zhenxing
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

This paper describes a handheld video browser that in corporates shot boundary detection, key frame extraction, semantic content analysis, key frame browsing, and similarity search

Crossref